Integrating multiple knowledge sources for word hypotheses graph interpretation
نویسندگان
چکیده
We present an integrated approach for the interpretation of word hypotheses graphs (WHGs) using multiple knowledge sources. Commonly, di erent knowledge sources in speech understanding are applied sequentially. Typically, speech understanding systems, such as the Verbmobil speech-to-speech translation system, rst use a word recognizer to determine word hypotheses, only based on acoustic and language model (LM) information. The resulting word sequences or WHGs are then segmented according to syntactic and/or prosodic information. Finally, these segments are interpreted by a parser or a stochastic process. Thus, it is impossible to use the knowledge of the syntactic-prosodic process, the parser or any other subsequent process to nd the best word sequence. In our new approach we use acoustic, prosodic and LM information to determine the best word chain, to detect syntactic/prosodic/pragmatic phrase boundaries and to classify dialog acts in one integrated search procedure, based on a WHG or a word lattice.
منابع مشابه
Integrating WordNet for Multiple Sense Embeddings in Vector Semantics
Popular distributional approaches to semantics allow for only a single embedding of any particular word. A single embedding per word conflates the distinct meanings of the word and their appropriate contexts, irrespective of whether those usages are related or completely disjoint. We compare models that use the graph structure of the knowledge base WordNet as a post-processing step to improve v...
متن کاملImproving the Readability of ASR Results for Lectures Using Multiple Hypotheses and Sentence-Level Knowledge
This paper presents a novel method for improving the readability of automatic speech recognition (ASR) results for classroom lectures. Because speech in a classroom is spontaneous and contains many ill-formed utterances with various disfluencies, the ASR result should be edited to improve the readability before presenting it to users, by applying some operations such as removing disfluencies, d...
متن کاملIntegrating multiple knowledge sources for improved speech understanding
In spoken dialog systems it is often the case that the sentence produced by the decoder with the highest recognition probability may not be the best choice for extracting the intended concepts. Lower ranking hypotheses may present better alternatives. In this paper, we show how to integrate multiple knowledge sources for the decision of selecting one of these hypotheses. A scoring schema combin...
متن کاملAn Efficient Decoding Approach for Dialogue Systems
We present a decoder for dialogue systems having both automatic speech recognition (ASR) and natural language (NL) understanding. Accurate and fast decoding of spoken utterances is a key to speech understanding. A good set of multiple word hypotheses produced by a decoder is also crucial for flexible integration of knowledge sources for language understanding. We introduce three contributions t...
متن کاملThe SuperARV Language Model: Investigating the Effectiveness of Tightly Integrating Multiple Knowledge Sources
A new almost-parsing language model incorporating multiple knowledge sources that is based upon the concept of Constraint Dependency Grammars is presented in this paper. Lexical features and syntactic constraints are tightly integrated into a uniform linguistic structure called a SuperARV that is associated with a word in the lexicon. The SuperARV language model reduces perplexity and word erro...
متن کامل